Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation
نویسندگان
چکیده
An elastic spectral distance measure based on a F0 adaptive pitch synchronous spectral estimation and selective elimination of periodicity interferences, that was developed for a high-quality speech modification procedure STRAIGHT [1], is introduced to provide a basis for auditory morphing. The proposed measure is implemented on a low dimensional piecewise bilinear time-frequency mapping between the target and the original speech representations. A preliminary test results of morphing emotional speech samples indicated that proposed procedure provides perceptually monotonic and high-quality interpolation and extrapolation of CD quality speech samples.
منابع مشابه
Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing
This article presents a tutorial information about STRAIGHT and TANDEM-STRAIGHT, a widely used speech modification tool and its successor as well as their application for speech morphing. They share the same concept that periodic excitation found in voiced sounds is an efficient mechanism for transmitting underlying smooth time-frequency representation. They also based on perceptual equivalence...
متن کاملExploration of the other aspect of Vocoder revisited ,
This article presents a tutorial information about STRAIGHT and TANDEM-STRAIGHT, a widely used speech modification tool and its successor as well as their application for speech morphing. They share the same concept that periodic excitation found in voiced sounds is an efficient mechanism for transmitting underlying smooth time-frequency representation. They also based on perceptual equivalence...
متن کاملمدلسازی بازشناسی واجی کلمات فارسی
Abstract of spoken word recognition is proposed. This model is particularly concerned with extraction of cues from the signal leading to a specification of a word in terms of bundles of distinctive features, which are assumed to be the building blocks of words. In the model proposed, auditory input is chunked into a set of successive time slices. It is assumed that the derivation of the underly...
متن کاملExemplar-based Voice Quality Analysis and Control using a High Quality Auditory Morphing Procedure based on STRAIGHT
This paper tries to introduce a new strategy and tools for voice quality research that complements conventional approaches. A very high-quality speech analysis, modification and synthesis procedure STRAIGHT, which is basically a channel VOCODER based on a pitch-synchronous analysis synthesis framework, was extended to implement auditory morphing in terms of spectral, pitch and voice quality par...
متن کاملPerformance of the Wavelet Transform-Neural Network Based Receiver for DPIM in Diffuse Indoor Optical Wireless Links in Presence of Artificial Light Interference
Artificial neural network (ANN) has application in communication engineering in diverse areas such as channel equalization, channel modeling, error control code because of its capability of nonlinear processing, adaptability, and parallel processing. On the other hand, wavelet transform (WT) with both the time and the frequency resolution provides the exact representation of signal in both doma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003